Evergreen or Ephemeral: Predicting Webpage Longevity Through Relevancy Features
نویسندگان
چکیده
With the rapid proliferation of user-generated content available on the Internet, one of the biggest challenges is determining the relevancy of the information shown. The content often comes in two camps: ephemeral or evergreen. Evergreen content such as recipes for carrot cake or intro to data structures frequently don’t change with time, whereas ephemeral content, such as celebrity hot or not trends or local high school sport scores easily become dated. Unlike apps that harp on ephemerality like Snapchat, the Internet doesn’t have the luxury of assigning expiration dates to content. Humans can easily distinguish one from the other, but machines have yet to do so.
منابع مشابه
Classifying Ephemeral vs Evergreen Content on the Web
ONE of the strengths of the internet is the proliferation of content available on virtually any topic imaginable. The challenge today has become sorting through this wealth of content to locate the information of greatest interest to each user. Many sites today implement recommender engines based on expressed and learned user preferences to direct users towards new content that the engine belie...
متن کاملLearning to Analyze Relevancy and Polarity of Tweets
This paper describes the participation of Oxyme in the profiling task of the RepLab workshop. We use a machine learning approach to predict the relevancy and polarity for reputation. The same classifier is used for both tasks. Features used include query dependent features, relevancy features, tweet features and sentiment features. An important component of the relevancy features are manually p...
متن کاملA Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection
Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...
متن کاملA Cost-Benefit Analysis of Leaf Habit and Leaf Longevity of Trees and Their Geographical Pattern
-To maximize net gain of a tree, leaves must be replaced when net gain of a leaf per unit time over the leaf's life span is maximum. A model in which leaf longevity is determined to maximize the net gain of a leaf per unit time is constructed. The model predicts that leaf longevity is short when initial net photosynthetic rate of the leaf is large, long when the construction cost of the leaf is...
متن کاملEvaluating Query-Independent Object Features for Relevancy Prediction
This paper presents a series of experiments investigating the effectiveness of query-independent features extracted from retrieved objects to predict relevancy. Features were grouped into a set of conceptual categories, and individually evaluated based on click-through data collected in a laboratory-setting user study. The results showed that while textual and visual features were useful for re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014